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Abstract. We consider " nonconvcntional" averaging setup in the form 
^1 = eB(X*(t),t( qi (t)),aq2(t)),-,t(u(t))) where f(t),t > is either 
a stochastic process or a dynamical system (i.e. then = F t x) with suffi- 
ciently fast mixing while qj(t) = ctjt, a\ < 02 < ... < ct^ and qj, j = fc+1, ...,£ 
grow faster than linearly. We show that the properly normalized error term in 
the " nonconventional" averaging principle is asymptotically Gaussian. 



1. Introduction 

Nonconventional ergodic theorems (see [12]) known also after 2j as poly- 
nomial ergodic theorems studied the limits of expressions having the form 
^/NYZ=i Fqi[n) h---F qi{n) }i where F is a weakly mixing measure preserving 
transformation, /j's are bounded measurable functions and q^s are polynomials 
taking on integer values on the integers. Originally, these results were motivated 
by applications to multiple recurrence for dynamical systems taking functions fi 
being indicators of some measurable sets and only convergence in the L 2 -sense was 
dealt with but later [I] provided also almost sure convergence under additional con- 
ditions. Recently such results were extended in [6] to the continuous time dynamical 
systems, i.e. to expressions of the form 

1 f T 

F^f 1 ---F^f e dt 

1 Jo 

where F s is now an ergodic measure preserving flow. 
In this paper we consider the averaging setup 

(1.1) X"(n + 1) = X"(n) + eBiX^n), (n)), £(«(»))) 

in the discrete time case and 

(1-2) = eB(X^t)^( qi (t)),...^( qe (t))) 

in the continuous time case with £ being either a stochastic process or having the 
form £(s) = F s f where F s is a dynamical system and / is a function. Positive 
functions qi, ...,qi will satisfy certain conditions which will be specified in the next 
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section, in particular, first k of them are linear while others grow faster than pro- 
ceeding ones. An example where (|1.2j) emerges is obtained when we consider a time 
dependent small perturbation of the oscillator equation 

(1.3) x + X 2 x = eg(x, i, t) 

where the force term g depends on time in a random way g(x,y,t) — 
g(x, y, £(<7i(i)), Then passing to the polar coordinates (r, </>) with 
x = rsin(A(i — <j>)) and x = Xrcos(X(t — <f>)) the equation (ll.3[) will be trans- 
formed into (ll.2p . It seems reasonable that a random force may depend on versions 
of a same process or a dynamical system moving with different speeds which is 
what we have here. 

As it is well known (see, for instance, [28 ), if B(x, yi, yi) is bounded and 
Lipschitz continuous in x and the limit 

1 f T 

(1.4) B(x)=\im- B(x,aqi(t)),...,a<ldt)))dt 

T-yoo I J Q 

exists then for any S > 0, 

(1.5) lim sup \X £ (t) - X £ {t)\ = lim sup \Z E {t) - Z(t)\ = 

£ ^°l)<t<S/f E ^°0<KS 

where 

dX^fi = ^g/^NN and z e,Q = X e (t/e), Z(t) = X £ (t/e). 
at 

In the discrete time case we have to take 

i N 

(1.7) B{x) = lim -Y,B(x,aqi(n)),...^(qe(n))) 



n=0 



and (|1.5p remains true with X £ given by (|1.6p and (|1.7|l . Almost everywhere limits 
in (|1.4|) and (11.71) can be obtained by nonconventional pointwise ergodic theorems 
from [6] and pQ, respectively, in rather general circumstances in the dynamical 
systems case and under another set of conditions existence of such limits follows 
from [20]. 

After nonconventional ergodic theorems (or in the probabilistic language laws 
of large numbers) are established the next natural step is to obtain central limit 
theorem type results which was accomplished in |24j . The averaging principle (11.5[) 
can be considered as an extension of the ergodic theorem and the main goal of this 
paper is to extend also central limit theorem type results to the above nonconven- 
tional averaging setup in the spirit of what was done in the standard (conventional) 
averaging case in [21] and [T7] . Central limit theorem type results turn in the aver- 
aging setup into assertions about Gaussian approximations of the slow motion X 6 
given by (jl.ip or by (|1.2p where £ is a fast mixing stochastic process or a dynamical 
system while unlike the standard (conventional) case we have the process £ taken 
simultaneously at different times qi{t) in the right hand side of (jl.ip and (|1.2p . 

We prove, first, our limit theorems for stochastic processes under rather general 
conditions resembling the definition of mixingales (see [33] and [37]) and then check 
these conditions for more familiar classes of stochastic processes and dynamical sys- 
tems. In [24] we imposed mixing assumptions in a standard way relying on two 
parameter families of er-algebras (see [5]) while our assumptions here use only ni- 
trations (i.e. nondecreasing families) of cr-algebras which are easier to construct for 
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various classes of dynamical systems. As one of applications we check some form of 
our conditions for Anosov flows which serve as fast motions in our nonconventional 
averaging setup where we rely on the notion of Markov families from [Sj and [5] . 

At the end of the paper we discuss a fully coupled averaging setup in our noncon- 
ventional situation where already an averaging principle itself becomes a problem. 

Acknowledgment. A part of this paper was done during my visit to the Fields 
Institute in Toronto in June 2011 whose support and excellent working conditions 
I greatfully acknowledge. 



Our setup consists of a p-dimensional stochastic process {£(t), t > or t = 
0,1,...} on a probability space (0,-7-", Pr) together with a filtration of cr-algebras 
Ti C J 7 , < I < oo so that J 7 / C Ti> if Z < V . For convenience we extend the 
definitions of Ti given only for I > to negative I by defining Ti = for I < 0. 
In order to relax required stationarity assumptions to some kind of weak " limiting 
stationarity" our setup includes another probability measure P on the space (fi, J- ). 
Namely, we assume that the distribution of £ (t) with respect to P does not depend 
on t and the joint distribution of {£(£), £(^')} f° r t>l/ depends only on t — t' which 
can be written in the form 



where [i is a probability measure on R p and fj, s , s > is a probability measure on 



Our setup relies on two probability measures Pr and P in order to include, for 
instance, Markov processes satisfying the Doeblin condition (see [16] or [TO] ) 
starting at a fixed point or with another noninvariant distribution. Then Pr will 
be a corresponding probability in the path space while P will be the stationary 
probability constructed by the initial distribution being the invariant measure of 
£(t). Usual mixing conditions for stochastic processes are formulated in terms of a 
double parameter family of cr-algebras via a dependence coefficient between widely 
separated past and future cr-algebras (cf. [5] and [23]) but this approach often is not 
convenient for applications to dynamical systems where natural future cr-algebras 
do not seem to exist unless an appropriate symbolic representation is available. 
By this reason we formulate below a different set of mixing and approximation 
conditions for the process £ which seem to be new and will enable us to treat some 
of dynamical systems models within a class of stochastic processes satisfying our 
assumptions. 

In order to avoid some of technicalities we restrict ourselves here mostly to 
bounded functions though our results can be obtained for more general classes of 
functions with polynomial growth supplemented by appropriate moment bounded- 
ness conditions similarly to [24]. For any function g — £) on M. p x R p introduce 
its Holder norm 



2. Preliminaries and main results 



(2.1) 



£(i)P = n and (£(t), £{t'))P = fi t ~t> for all t > t' 



Here and in what follows \ip — for two vectors ip = (ipi, tp g ) and ip = 
(ipi, tpg) denotes the sum J2i=i \i>i ~ ^T- Next, for p, q > 1 and s > we 
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define a sort of a mixing coefficient 

(2.3) V P ,nA n ) = su Pt>o { \\E(g(Z(n + i),£(n + t + a))\F lt] ) 

-Ep9(£(n + t),Z{n + t + s))\\ p : g = g(£,0, \g\ K < l}, v P A n ) = V P ,K,o{ n ) 

where || • \\ p is the L p -norm on the space (il, J 7 , Pr), [■] denotes the integral part and 
throughout this paper we write E for the expectation with respect to Pr and Ep for 
the expectation with respect to P. We will need also an (one-sided) approximation 
coefficient 

(2-4) Q(n) = sup \\Em\^[t]+n) - m\r 

t>o 

2.1. Assumption. Given k € (0, 1] there exist p, q > 1 and m, 5 > satisfying 

(2.5) 7m = E|£(0)P <oo, i>- + - + - ^, K<Z >1 

2 p m q p 

with g = (£ — l)p and such that 

oo 

(2.6) V n{r]p^{n) + d(n)) < oo and lim ru, K s (n) = for all s > 0, 

n=0 

where ^ < 9 < k. 

Next, let B = B(x,£) = (S^^(a;,^), ...,B( d \x,£)), £ = (6,-, 6) G K £p be a 
<i- vector function on M. d x R £p such that for some constant K > and all x £ M. d , 

£,f eR^, i,j,l = i,...,d, 

(2.7) i^or, oi < |s (i) (*,o - oi < - x\ + e - =1 io - on 

and |gg^aai< g) l 9 yj a ' S) |<^ 

I aa^j I — 'I oxjoxi I — 

We will be interested in the central limit theorem type results as e — > for the 
solution X e (t) — X%.(t) of the equation 

(2.8) — ^1 = £ B(X £ (t) ; e(gi(t))^(92(t)),...,^W)), *S(0) =s,*6 [0,7>] 

where <?i(i) < 92 (i) < • • ■ < © (t), t > are increasing functions such that <7j(t) = 
ctjt for j < k < I with ai < Oi2 < ■ ■ ■ < ctk whereas the remaining q'^s grow faster 
in t. Namely, we assume similarly to [24 that for any 7 > and k + 1 < i < £, 

(2.9) lim(g i (t + 7)-? i (t)) = oo 

t— f 00 

and 

(2.10) Iim( 9i (7t)-g j _i(t)) = oo. 

t— too 

Set 



(2.11) B(x) = I B(x,tu...,£ t )d l i(Z 1 )"-diJ l (Z t ). 

We consider also the solution X s (t) — X^.(t) of the averaged equation 

(2.12) ^l =e B(X%t)), X*(0)=x. 
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It will be convenient to denote Z £ (t) = X 6 (t/e), Z(t) — X £ (t/e) and to introduce 
Y%t) = Y y %t) by 

(2.13) Y y E (t) = y + [ B(2(aU( qi (s/e)), ^(s/e)), ^gt(a/e)))da. 

Jo 

2.2. Theorem. Suppose that (M7f\ ), (EjHj) , $K1W and Assumption PHI hold true. 
Then the family of processes G e {t) = E~^' 2 (Y^(t) — Z z {t)), t € [0,7"] converges 
weakly as e — » to a Gaussian process G°(t), t £ [0,7] having not neces- 
sarily independent increments (see an example in |24j ) with covariances of its 
components G°{t) = (G 0,1 (i), G°' d (t)) having the form EG°> l {s)G a > m (t)) = 
jmin(s,t) j^l,m^ u ^ u yjtffa £/j e matrix function {A l ' m (u), 1 < l,m < d} computed 
in Section^ Furthermore, the family of processes Q E {i) = e~ 1 / 2 (Z E (t) — Z(t)), t € 
[0, T] converges weakly as e — > to a Gaussian process Q (t), t € [0, T] which solves 
the equation 

(2.14) Q°(t) = G°(t) + [ VB{Z(s))Q°{s)ds. 

Jo 

In the discrete time setup il.l]) the similar results hold true assuming that qi 's 
take on integer values on integers, 7 in \2. 9\) is replaced by 1, ai is replaced by i 
for i = 1, k and defining Z e (t) = X e ([t/s]) together with Y e — Yy given by 

(2.15) Y°(t) =y+ [ B(Z(s)^( qi ([s/e])),aq2([s/s})),...,aqe([s/E})))ds 

Jo 

while leaving all other definitions and assumptions the same as above. 

Observe that we work with B defined by (|2.11j) but in our circumstances the 
central limit theorem type results from [24] imply also (jl.4j) and (|1.7|) . at least, in 
the L 2 -sense while a nonconventional law of large numbers from |20) and (under 
stationarity assumptions) pointwise nonconventional ergodic theorems from [I] and 
[6] yield fj 1 .4[) and (|1.7j) also for the almost sure convergence. Note also that we need 
the full strength of (|2.6j) only for one argument in Section [4] borrowed from [21] but 
for a standard limit theorem not in the averaging setup, i.e. when B{x, £1, = 
B(£i, ...,£e) does not depend on x, it suffices to require only summability of the 
expression in brackets in (|2.6[) . 

An important point in the proof of the first part of Theorem 12.21 is to introduce 
the representation 

(2.16) B(x,t)=B(x) + B 1 (x,ti) + ---+B t (.x,Zi,... i £ t ) 
where £ = (£1, ...,&) and for i < £, 

(2.17) Bi(x, £1,. ..,&) = 

J B(x, £1, &)d[j,(ti +1 ) ■ ■ ■ dfifa) - J B(x, £1, &)dfi(£i) ■ ■ ■ dn(&) 

while 

(2.18) Be(x, 6,..., 6) =B{x, &,...,&)- J B(x,t 1 ,...,£ t )dn{Zt). 
Next, we introduce 

(2.19) y/(t) = f* /ai B i (Z(s),Z(q 1 ( S /E)),Z(q 2 (s/e)),...,Z(qe( S /e)))ds, i = l,...,k, 
Yf(t) = Jo B i (Z(s),aqi(s/£))^(q2(s/e)),...,^qe(s/e)))ds, i = k + !,...,£ 
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and Y Q 6 (t) = Y £ y (t) = y + J Q * B l {Z(s))ds. Thus Y £ from ff27T3]> has the representa- 
tion 

k I 

(2.20) Y y %t) = y e (t) + Y f(*it) + E ^ (*)• 

i=l i=fc+l 

We consider also X$(t) = X £ (t), Xf{t) = Xf x {t) = x + 

ef*B i (Xf(s),Z(q 1 (s)),...,t;(qt(s)))ds and Z?(t) = X?(t/e)' for all i > 0. 
For i > 1 set also 

(2.21) Gt(t) = e-^Yfit) and Q?(t) = e^Zf (t). 

Relying on martingale approximations (which also can be done employing mixin- 
gales from [25] and [27]) we will show that any linear combination 2i=i AjGf con ~ 
verges weakly as e — > to a Gaussian process 53i=i AiG° ■ It turns out that in the 
continuous time case each G?, i = k + 1, ...,£ converges weakly as e — > to zero, 
and so the processes Yf, i > k do not play any role in the limit. It follows that 
G £ converges weakly to a Gaussian process G° such that G(t) — J2i=i AiG°(a,t). 
On the other hand, in the discrete time case each G £ , i> k cannot be disregarded, 
in general, and it converges weakly as e — > to a Gaussian process G° which is 
independent of any other G° . The above difference between discrete and continuous 
time cases is due to the different natural forms of the assumption (12.91) in these two 
cases. These arguments yield the first part of Theorem 12.21 while its second part 
concerning convergence of Qf as e —¥ is proved via some Taylor expansion and 
approximation arguments. 

In order to clarify the role of the coefficients rj p _ R and Cq we compare them with 
the more familiar mixing and approximation coefficients defined via a two parameter 
family of u-algebras Q s ,t € J~, — oo < s < t < oo by 
(2.22) 

zu p (n) = sup {||£7(p|S_oo,a) - E P g\\ p : g is Q s+n ,oo - measurable and \g\ < l} 

s>0,g 

and 

(2.23) /3 q (n) = sup \\E(m5t-n,t+n) ~ £(t)|| 9) 

t>0 

respectively, where Gst C Q s 't' if s' < s and t' > t. Then setting Ti = Q-oo,l we 
obtain by the contraction property of conditional expectations that 

(2.24) &(n) > SU Pt > \\E^{t)\Gt-n,t+n) ~ £(«) + £(«) ~ £(£(t)|0-oo,[t]-HH-l)|| g 

> C ? (n + 1) - /9,(n) i.e. f3 q (n) > ±( q (n + 1). 

Furthermore, 

\\g(£(n + t),£(n + 1 + s)) - g(E(£(n + t)\g n+t _ [n/2]7n+t+[n/2] ), 
E{£{n + t + s)|6 n+t+s _ [n/2] ,„ +t+s+[n/2] )) || p < 2\g\ K (3; K ([n/2}), 

and so 

(2-25) Vp , K (n) < K([n/2]) + 2/3 p K K (K2]))| 5 | K . 

Thus, appropriate conditions on decay of coefficients w p and /3 ? as in [24j yield 
corresponding conditions on ry p K and Q. The other direction does not hold true 
but still it turns out that most of the technique from |24] can be employed in our 
circumstances, as well. 
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The conditions of Theorem l2.2l hold true for many important stochastic processes. 
In the continuous time case they are satisfied when, for instance, £(t) = /(3(i)) 
where is either an irreducible continuous time finite state Markov chain or 
a nondegenerate diffusion process on a compact manifold while / is a Holder 
continuous vector function. In the discrete time case we can take, for instance, 
£(n) = /(H(n)) with S(n) being a Markov chain satisfying the Doeblin condition 
(see, for instance, [16], p.p. 367-368). In all these examples r] PtK (n) and ( q (n) decay 
in n exponentially fast while (I2.6[) requires much less. In fact, in both cases 
may depend on whole paths of a Markov process 3 assuming only certain weak 
dependence on their tails. 

Important classes of processes satisfying our conditions come from dynamical 
systems. In Section|6]we take = £(t, z) — g(F t z) where F* is a C 2 Anosov flow 
(see [23]) on a compact manifold M whose stable and unstable foliations are jointly 
nonintegrable and g is a Holder continuous p- vector function on M. It turns out 
that if we take the initial point z on an element S of a Markov family (see Section 
16)) introduced in [Si distributed there at random according to a probability measure 
equivalent to the volume on S then Assumption 12 . 1 1 can be verified. This does not 
yield though a desirable limit theorem where the initial point is taken at random 
on the whole manifold M distributed according to the Sinai-Ruelle-Bowen (SRB) 
measure (or the normalized Riemannian volume). We observe that a suspension 
representation of Anosov flows employed in |17j to derive limit theorems in the 
conventional averaging setup does not work in our situation because F 9i ^'x, i = 
1, t arrive at the ceiling of the suspension at different times for different i's. 

In the discrete time case there are several important classes of dynamical systems 
where our conditions can be verified. First, for transformations where symbolic 
representations via Markov partitions are available (Axiom A diffeomorphisms (see 
[3]) and expanding endomorphisms, some one-dimensional maps e.g. the Gauss map 
(see [15) 1 etc.) we can rely on standard mixing and approximation assumptions 
based on two parameter families of er-algebras as in (I2.22|) and (|2 . 23|) . On the 
other hand, for many transformations Markov partitions are not available but still 
it is possible to construct one parameter increasing or decreasing filtration of a- 
algebras so that our conditions can be verified. For some classes of noninvertible 
transformations F it is possible to choose an appropriate initial er-algebra T§ such 
that F^Tq C J-q and then to define a decreasing filtration Ti = F~ 1 Fq (see [25] 
and [13]). Passing to the natural extension as in Remark 3.12 of |T3] we can turn to 
an increasing filtration and to verify our conditions. On the other hand, our results 
can be derived under appropriate conditions with respect to decreasing families 
of a- algebras. Namely, let J- D J-q D T\ D J 7 2 D ■ ■ ■ and define mixing and 
approximation coefficients by 

(2.26) Vp!KtS (n)=sup t > s {\\E(g(Z(t),Z(t-s))\F [t]+n ) 

-Epg{£{t),t(t- s))\\ p : g = g(t,£),\g\ K <i}, v P A n ) = »7p,«,o(») 

and 

(2.27) C,(") = sup \\E(m^[t]-n) ~ £(*)ll<r 

t>n 

Then under Assumption 12.11 we can rely on estimates of Section [3] below and in 
place of martingales there arrive at reverse martingales and to use a limit theorem 
for the latter. 
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2.3. Remark. If B = then according to Theorem 12.21 the process X e (t) is very 
close to its initial point on the time interval of order 1/e. Thus, in order to see 
fluctuations of order 1 it makes sense to consider longer time and to deal with 
V e (t) = X e (t/e). Under the stronger condition f B(x,£i, ^)cfyi(^) = it is not 
difficult to mimic the proofs in [22] and [4] relying on the technique of Sections [3] 
and H] below in order to obtain that V £ (t), t £ [0, T] converges weakly as e — >• 
to a diffusion process with parameters obtained in the same way as in |22j and 
[3]. It is not clear whether, in general, this result still holds true assuming only 
that B = 0. Though most of the required estimates still go through in the latter 
case a convergence of V £ to a Markov process seems to be problematic in a general 
nonconventional averaging setup. 

3. Estimates and martingale approximation 



The proof of Theorem [272] will employ a modification of the machinery developed 
in [24]. First, we have to study the asymptotical behavior as e — > of 

rn(t)/e 

(3.1) Gf(t)=Ve/ B i (Z(esU(q 1 (s)),...,Z{q i (s)))ds 



o 



which is obtained from the definition (|2.2ip by the change of variables s — > s/e and 
where Tj(t) = t/di for i — 1, k and Ti(t) = t for i = k + 1, I. Observe that if 
W+i < £ < j? and N > 1 then by (|2~71) . 

(3.2) \Gm-Gy N (t)\< 2Kl11 



N 

and so it suffices to study the asymptotical behavior of G\^ N as N — > oo. Set 

rn+l 

(3.3) I it N(n) = / Bj(Z(«/JV),e(?iW),...,f(gi(*)))da. 

J n 

In view of (|2.7[) the asymptotical behavior of G X J N as — > oo is the same as of 
N-^ 2 S^ N {t) where 

[jVTi(t)] 

(3-4) Si,w(*)= E 

n=0 

There are two obstructions for applying directly the results of [24] to the sum (|3.4[) . 
First, unlike [23] the integrand in (|3.3|) depends on the "slow time" s/N. Secondly, 
our mixing and approximation coefficients look differently from the corresponding 
coefficients in [24]. Still, it turns out that these obstructions can be dealt with and 
after minor modifications the method of [53] start working in our situation, as well. 
Namely, the dependence on the " slow time" being deterministic will not prevent us 
from making estimates similar to [24) while dependence of if on N will just require 
us to deal with martingale arrays which creates no problems as long as we obtain 
appropriate limits of variances and covariances. Concerning the second obstruction 
we observe that one half of the approximation estimate from [24] is contained in the 
coefficient Q, while another half is hidden in the coefficient rj p ^ which also suffices 
for required mixing estimates. 

We explain next more precisely why estimates similar to |24) hold true in our 
circumstances, as well. Let f(ipi£,t;) be a function on K e x R p x R p such that for 



Nonconvcntional averaging 



9 



any ip, ip' g R s and £, f, £' g M p , 
(3.5) 

l/ty.&f) ~ f)l < - V>T + |€ -e'| K + and |/(V,£,f)| < C. 

Then setting g(^) = E P f(ip,£(Q),£(s)) we obtain from flU) and j!3J that for all 
it, u > and n g N, 

(3.6) ||£?(/(V»,f(n + «),^(n + u + i;))|^ M ) -ff^JHp < C^^n). 

Let h(ip,u) = E(f(ip,£(n + u),£(n + u + v))|-7>]) - s(V>)- Then by ([33]) we can 
choose a version of w) such that with probability one simultaneously for all 

(3.7) \h(if,,w)-hW,w)\<2C\1>-,l/\«. 

Since, in addition, ||/i(^,a;)||p < C-q PtK {n) by (|3.6[) for all yj € we obtain by 
Theorem 3.4 from 24 that for any random g- vector ^ = ^(uj), 

(3.8) \\h(*(w),u)\\ a < cC( ?7p , K , u (n)) 1 "*(l + ||*|| m ) 

where - < 6 < k, ^ > ^ + ^ and c = c(g,p, n,6) > depends only on parameters 
in brackets. Since 

(3.9) w) = £(/(*, £(n + u), £(n + u + v))\T [u] ) (lu) a.s. 

provided ^ is .Fr^i -measurable we obtain from (|3.7[) - (|3.9[) together with the Holder 
inequality (cf. Corollary 3.6 in [21]) that, 

(3.10) \\E(f(y,Z(n + u),Z(n + u + v))\F [u] )-g(*)\\ a 

< Civp^in)) 1 -^^ + \\n,n) + 2C||* - E^\T [u] )\\ s q 

provided - > - + — + -. 

L a — p m q 

We apply the above estimates in two cases. First, when /(?/>, £, £) = 

f(ip,0 = with ^ = (&,..., e K^- 1 )^, C = 6 g rp, 

n - - <ft-i(t))/2], u = ft(t) - nand*= 

In the second case f(ip, £, £) = B^x, £i, €i)Bj(y, fj, fj) with ^ = 
(&,... ,6-1, G R«+*-*>p, £ = g M p , n - [(mm( qi (t), qj (s)) - 

max(gi_i(<), (s)))/2] when n > 0, u = min(gj(t), qj(s)) — n and $ = 
... > ^(5i-i(i)),C(g , i(s)), ...,^(gj-i(s))). The estimates for the first case are 
used for martingale approximations while the second case emerges when computing 
covariances. 

Since f Bi(x,£i, = we obtain by (|3.10p the estimate 

(3.11) ||25(iM£(3i(t)),...,£^ 

for some C > independent of i where n = n.;(i) = [(qi(i) — gj_i(t))/2]. Next, for 
any a: g R e , £i, € K e and r = 1, 2, ... set 

fl i , r (a;,ei,...^ i _i^(t)) = £;(fl(a!,ei,...,6_i^(t))|^ w+r ) 
and frW =-E(C(*)|J 7 [t]+r)- 
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Then by (|2.4|) and (J2T7J) together with the Holder inequality, 

(3.12) \\B t (x^u.:,^-um) ~ B hr (x^u...,^-um)\\ q 
<2\\B l (x^ ll ...,^ li at))^B l (x^ ll ...,C^ 1 ,E(m\^[t] + r))\\ q 

< 2Kd\\m - E(t(t))\?lt]+r)\% < 2Kd( s q (r). 
Moreover, similarly to Lemma 3.12 in [24] we obtain that 

(3.13) ||Si(l,f(«ft(t)),...,€(«i(0))-fli,r(s,^(ft(t)),...^r(«i-l(t)))L 

provided i > | + ^ + | and <5 < min(/t, 1 — ^) where c = c(<5, a,p, q) > depends 
only on the parameters in brackets. Set 

b l f{x,y;s,t) = E{Bf ) (i,f(giW),.,feWPj m) (^(?i(i)),..,%(i)))) 

where, recall, is the i-th component of the d- vector B t . Now, by (|2.7p . p. lip 
and |3T3l) . 

(3.14) ^{x^s^KC^^n)) 1 -* + {C q {n)) s ) 

where C > does not depend on s, t > and n = riij(s, t) — max(fty-(s, i), fiji{t, s)) 
with hij(s,t) = [|min(gj(s) - g^(t), ^(s) - %_i(s))]. 
Now, set 

(3.15) /^(n) = /^^^(^(s/JVj.erfeiWJ.-^rfe-iCs)))^, S,w,r(t) = 

^v[iVn(t)] I. Nr ( n ^ R i r {m) = YaLttl+I E{I^ N A l )\^rn+r), A,iV,r(m) = 
Ii,N,r( m ) + RiA m ) - Rt,r(m ~ 1) and M^iy.r (*) = W1 A,iV,r(H). 

In view of p. lip applied with a = 2 we see that the series for Ri_ r (m) converges in 
L 2 , Di,N,r{ m ) i s -^m+r-measurable and since E(Di t N ,r(™)|^ r m-i+r) = we obtain 
that {A,JV,r(w), -7 r m+r}o<r?j.<[JVr i (T)] is a martingale differences array. Next, we 
proceed similarly to Sections 5 and 7 of [24] observing that the limiting behaviour 
of A r_1 / 2 S'i i 7v ir (i) as N — > oo is the same as of iV _1 / 2 Mi i 7v iI .(t), then dealing with 
the latter by means of martingale limit theorems and, finally, employing the repre- 
sentation 

oo 

(3.16) S i>N (t) = S i , N , 1 (t)+Y,( Si > N >* r & ~ S *,N,2r-i(t)). 

r=l 

In order to complete this programm it remains only to compute limiting covariances 
as in Section 4 of [24] taking care also of the slow time s/N entering (|3.3p and f|3.15j) . 

4. Limiting covariances 

In this section we show the existence and compute the limit as N — > oo of the 
expression 

(4.1) EiGl^Gl^t)) = s f' (S/£) r it/£) b l f(Z(su),Z(ev);u,v)dudv. 

Jo Jo 

We start with showing that there exists a constant C > such that for alH > s > 
0, I = l,...,d, N > 1 and i = 1, ...,£, 

(4.2) sup E\G]{ N (t) - G][ N {s)\ 2 < C(t-s). 

s>0 
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In order to obtain (|4~2]l we note that by (f279|) and (|2.10p for t > s, 

(4.3) qi(t) - qi(s) >Oi(t— s) and q^t) - g»j_i(t) > Oi-it when i = 2, k 
and for any 7 > there exists t 7 such that for all t > t 7 and i = A; + 1, 

(4.4) ?i( f )-%( s ) > (i~s)+ 7 - 1 andg,-(*)-ft_i(i) >i + 7 _1 - 

Now g2| follows from ([2~6]t . (^14|) . flO}, gSJ and (03}. Observe, that by fl3J2 
and imj if < e < i then 

|i5G?>)G£ m (t) - £G*f( S )G^ (t)| < 



4KdCVT 



and so it suffices to study (j4.1[) as e = i and iV — > 00. 

Next, we claim that if i > j and i > k then the limit in (|4.1j) as = = — > 00 
exists and equals zero. Indeed, in this case for any small 7 > with 7T < s, 

(4-5) \EG\f ( 8 )G)S (t)\<h +h 

where by (|4.2I) . 
(4.6) 

h = l< w (7r)C(i)| < (i?(G.:f (HT)Y) 1/2 (E{ G )^{t)Y) 1/2 < cVWt 

and by (|3Jg]l . 

(4.7) J a = |15(G}f (.) - Gjf (7T))G^(t)| = 1 j^<«0 
b^ m (Z{u/N),Z(v/N);u,v)dv < % f£„du tf (m Pij (u,v)dv 

where 

(4.8) Pij(u,v) = (?7p iK (n.y(M,w))) 1_ * + (C g (n»j (u, v))) 5 

with nij(s,t) defined after (|3~T4|) . It follows from p^]) . (|2J)) . (|2~T0|) and (gSJ that 
for any 7 > there exists iV 7 such that whenever N > iV 7 and v € [0,T-/V] (cf. 
Proposition 4.5 in [24]). 

Pij(u,v)du < 7, 



'yTN 

and so ^2 < CT7. Since 7 > is arbitrary this together with (j4.5[) and (|4.6p yields 
that for all Z, m = 1, d, i > k and j < i, 

(4.9) lim EG 1 i f(s)G]£(t)=0. 

Next, we claim that when i > k then also for all I, m — 1, d, 

(4.10) hm EGlf(s)Gl£(t)=0. 

TV— voo 1 ' 

Indeed, by ff37T4]) and |@~HJ} for i > s, 

(4.11) (s)G 1 £(t)\ <- du Pii (u,v)dv = h + h 



where 



Is = — / <iit / pu(u,v)dv and Zi = — / g?w / pa(u,v)dv 



N In ./,, ' " V ' iV 



AT 
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Now 



(4.12) 7 3 = jj J° N du J u tl+7 pa (u, v)dv + % J^ N du f*_^ p u (u, v)dv 
for some C > where by (|2.6p and (|2.10l) for any 7 > 0, 



(4.13) /3 7 (M) = sup / pu(u,v)dv < 00 and lim /3 7 (M) = 0. 

u>M J«+ 7 M->oo 

Next, 

(4.14) h = i J Q S J s ^ +7 w)dw 

+17 Jo ^ du /°« < CS7 + Cs^ s (AT) . 

Finally, (jHE| follows from ([iTTT]l - (|4lT3]l letting, first, AT -> 00 and then 7^0. 

In order to compute the limit as | = A" — >• 00 of (|4.ip for i, j = 1, 2, fc we 
recall an argument of Lemma 3.1 from |21) which yields that if uniformly in a > 
and x, y from a compact set the limit 

-, Acr+sN)/ai r(tr+sN)/aj 

(4.15) Jim-/ du b^(x,y;u,v)dudv^sD l l ' m (x,y) 

)rm of the right ham 
(|4.1j) exists, as well, and it has the form 



exists and has the form of the right hand side with a continuous -D''™ then the limit 



/>min(s,t) 

(4.16) lim E(G 1 i f(s)G 1 £(t))= / D^(Z(u), Z(u))du. 

JV-s-oo Jo 

Namely, set M = M(N) = [N 2/3 ] and let s t = jg, 1 = 0, 1, M - 1. Assume 
also that s < t. Let 

An = U^o^JV,* with Ajv, t = {(«,«) : sJV < u,w < (s 4 + jj)N} 

and 5^ = {(<u, v) : <u < sN, <v < tN} \ A N . Then 

(4-17) EG]> N {s)G 1 l"{t)=h + h 



where 



and 



1 



h = Ni] I h ^ Z{ - u / N ^ Z ( v / N ^ U A> v/j)dudv 



J 8 = TF~ I bl iT( Z { u / N )i Z ( v / N )i u /h v/j)dudv. 



Am 



Nij 

Now, by (j3~T4| and (|L8|) . 

(4.18) i/ 5 i < ^(EfdoV; i/e /i;; £ +ir)/£ (^(«M,«/a J ) 

+Pji(u/aj ,v I ai))dudv + J Q s ' /e f* l J^ p l3 (u/ai, v/aj)dudv). 

Observe that by the definition of riij(u,v) after (|3.14|) we can write for i, j = 
1, fc, 

(4.19) pijiu/a^v/aj) =((\u-v\) 
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where C > satisfies / °° w^(w)dw < oo. Integrating by parts we obtain for any 

V>U>0, 

(4.20) 

cU rV f-U />oo f-U f-oo 

du I ((v — u)dv < du ((w)dw = / rQ(r)dr < / r(,{r)dr. 

io Ju Jo Ju-u Jo Jo 

This together with ([H]), (j4~T8| and (j4~19l) gives by the choice of M = M(N) that 

M 

(4.21) \h\<C— ^OasiV^oo 

NctiCtj 

for some C > independent of M and N. 
Next, 

M-l 

(4-22) h = 77 V Jm,n{l) + h 

Ma iaj ^ 

where 

Mr ii it 

Jmm(l) = -j- / b^(Z(s L ), Z(s L ); -, —)dudv 

iV J Sl <u,v<{s b + -fa)N a i a j 

and by jUTJl and the choice of M = M(N), 

(4.23) |J 7 | < Cs 3 NM~ 2 — > as TV — > oo 

where C > does not depend on s,iV and M. By (|4.15j) we obtain that 

(4.24) | Jm.jvW ~ sa.a^'f l (^(«J> £(*t))l -> as iV -> oo, 
and so 

(4.25) |7 6 - ^ D\f(Z{u),Z{u))du\ — !• as TV — !• oo 

completing the proof of (14. 16[) . 

In order to describe D l i 'J n (x, y), i,j < k consider all indices 1 < i[ < i' 2 < ... < 
i' H . = i and 1 < j[ < j' 2 < ... < j' H . = j such that there exist < p\ < ... < p Hj = 1 
satisfying a^pi, a^pi G {cti, au} for all I = 1, . Define 

(4.26) ajf (x,j/; Sl) ..., Sli ,) = /sf ) (x^ 1 ,...,^)5 i (y,a,...,| 7 -) 

7 0{-ii,...,-ij. . },l<i r <i .},l<j<<.j d ^3<^ 

Then in the same way as in the proof of Lemma 4.4 from [51] (see also Section 7 

there) we obtain relying on (j2~6l) . (|3T0)) and (j3T4l) that 

(4.27) 

lim b l ^(x,y;Nu N , Nv N ) = a^(x,y; p x w, p 2 w, ...,p L w). 

iv— »oo, a.iNitN—ajNvN=w J J 

This is the only place where we need Assumption ^ . 1 1 for r] p ^ jS with s > 0. It follows 
similarly to Section 7 of [24] that the limit ([4.15)1 exists and it can be written in 
the form 

i r°° 



(4.28) D t ' J m (x,y) = / a?™(x, y; ptw, p 2 w, p Hj w)dw. 

Collecting the results of Sections |3] and |4] together we conclude that each G\,% = 
1, k converges weakly as e — > to the corresponding Gaussian process Gf having 
independent increments while the process Gf, i > k converge weakly as e —> to 
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zero (in the continuous time case we are dealing with now). Moreover, the processes 
G £ converge weakly as e — > to a Gaussian process G° (with not necessarily 
independent increments as an example in |24| shows) having the representation 

k 

(4.29) G\t)=Y J G s i {it)- 

i=l 

Furthermore, the covarianccs of different components G®(s) = (G®' 1 , G®' d (s)) of 
this processes are described in view of the above by 

/>min(s,t) 

(4.30) EG° i ' l (s)G° j ' m (t)= Dlf(Z(u),Z(u))du, 

Jo 

and so by (|4T29|) . 

/•min(s,£) 

(4.31) EG°> l (s)G°' m (t) = / A l ' m (u)du 

Jo 

where 

A l > m (u) = £ D\f(Z(iu),ZUu)). 

l<i,i<fc 

5. Gaussian approximation of the slow motion and discrete time case 

In order to complete the proof of Theorem l2.2l we proceed similarly to [3T]. First, 
we consider the process H e (t) which solves the linear equation 

(5.1) H £ (t) = G e (t)+ f VB{Z{s))H e {s)ds. 

Jo 

By (|2.7j) . for some C > independent of t and e, 

\H £ (t)\<\G%t)\ + C f\H £ (s)\d S . 

Jo 

Then 

\\H e {t)\~\G £ {t)\\ <C [ \G e (s)\ds + C [ \\H e ( s )\~\G E (s)\\ds 
Jo Jo 

and by Gronwall's inequality 

(5.2) \H e (t)\ < \G e (t)\ + Ce ct [ \G'{s)\ds. 

Jo 

It follows from Section |3] that the family of processes {G e (t), t g [0,7^]} is tight 
which together with (|5.2|) implies that the family of processes {H s (t), t € [0,T]}, 
as well, as the family of pairs V s = {G £ , H £ } are tight. 

It follows that any weak limit V^ = {G°, H } of V 6 as e — > must satisfy the 
equation 

(5.3) H°(t) = G°(t) + f VB(Z(s))H°(s)ds 

Jo 

which has a unique solution. Moreover, its solution H° is a Gaussian process. 
Indeed, the equation (|5.3|) can be solved by successive approximations starting 
from G° so that on each step we will get a Gaussian process (in view of linearity) 
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and the limiting process will be Gaussian, as well. Moreover, H° depends linearly 
on G° having an integral representation of the form 

(5.4) H°(t) = G°(t) + [ K(t, s)G (s)ds 

Jo 

with a differentiable kernel K (Green's function). The latter follows considering an 
operator A given by 

Af(t) = [ VB(Z(s))f(s)ds 
Jo 

which has the supremum norm less than 1 if t e [0, A] for A small enough, and so 
we can write 

oo 

H° = (I- A)~ 1 G° = G° + £ A n G°. 

n=l 

In view of the form of the integral operator A above this representation yields (|5.4p 
on the interval [0, A] and then employing the same argument successively to time 
itervals [A, 2A], [2A, 3A], ... we extend the representation (15.41) for any t. 
Observe that 

(5.5) Q^t)=e-^J^B(Z^ s )^( qi ( s /e)),...,^ qe ( S /e)))-mAm ds 
= G%t) + J*V x B(Z-(s),aqi(s/s)),... ) aqi(s/e)))Q s (s)ds + ^Jf(s)ds 

where 

Jf(s) = e- 1 ' 2 (B(Z x (s) + ^iQ s (s), aqi(s/s)), Z{qt(s/e))) - B{Z x (s), 

£((?! (s/e)), e(«(s/e))) - V x B{Z x {s), £(qi(s/e)), t(q e ( S /e)))V^Q e (s)) ■ 

If H e solves (EH]) then U e {t) = Q e (t) - H e (t) satisfies by the equation 
(5-6) 

U%t)~ [ V x B(Z x (s),t( qi (s/e)),...,t(qt(s/£)))U s (s)ds= [ ( J{ (s) + Jf(*))da 
Jo Jo 

where 

J|(s) = (V x B(Z x (s),Z(q 1 (s/s)),...,Z(q i (s/E)))-V x B(Z x (s)))H%s). 
By Gronwall's inequality we obtain that 

(5.7) \U e (t)\ < Cte ct f | JJ(s) + Jl(s)\ds 

Jo 

for some C > independent of e and t € [0, T\. 

Thus, in order to prove that Q e converges weakly as e — > to a Gaussian process 
Q° solving (|2.14p it suffices to show that J* Jf(s)ds and f Q Jf (s)<i,s converge to zero 
in probability as e — >• 0. By (|2.7I) . 

\Z x (t) - Y*(t)\ < C [ \Zl(s) - Z x (s)\ds = CV~e f \Q £ x {s)\ds 
Jo Jo 

with C = Kd, and so 

\Q x (t)\<\G £ (t)\+C f\Ql{s)\ds. 

Jo 
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Hence, in the same way as in (|5.2[) . 

(5.8) \Q%(t)\ < \G%t)\ + Ce ct [ \G s (s)\ds. 

Jo 

By (|2.7p and the Taylor formula with a reminder we conclude that 

(5.9) \J!(s)\ <CV~e\Q 6 ( S )\ 2 
which together with (|4.2j) yields that E\J[(s) \ — ^Oase^O. 

The proof of convergence to zero in probability of J Q Jf (s)(is as e — > is based 
on the integral representation (|5.4p . Set 

and 

*(a;,£i,...,6) - V x B(z,£i, ...,&) - V XJ B(z). 
Relying on the representation (15 -4[) we obtain that 

(5.10) |£ f J!(s)ds\ < |J 3 e WI + |J 4 £ WI 

Jo 

where 

(5.11) J|W= £ 3 / 2 / t/£ d s /;d u £;(*(Z ;c ( £s ),C(gi( S )),...,efe(s))) 

x<f>(Z x (eu),Z( qi (u)),...,Z(q e (u)))) 

and 

(5.12) Jf(t) = e 3 ' 2 fj e ds f Q s du dvK(es,ev) 

x£;(*(Z a (es),€(9i(«)),...,e(g / («)))$(Z x (e«)^(9i(u)),...^(g i (u)))). 

Estimating the expectations in (|5.11[) and (I5.12[) via (|3.10[) similarly to p,14j) we 
obtain that both Jf (i) and Jf (i) are of order y^e, and so the left hand side of (I5.10[) 
is of order ^/e, as well. For more details of a similar argument we refer the reader 
to [21] . This completes the proof of Theorem 12.21 concerning the continuous time 
case. 

In the discrete time case the proofs are similar but slightly simpler. Namely, set 

[Ntji] 

(5.13) R\ /N {t) = N- 1 / 2 ]T B t {Z{nt/N)^i(n)),...,^(n))) 

n=0 

where Sj's are the same as in (|27T6)) — t[27T8]) . Then for all N > 1, 

(5.14) \G]' N {t) ~ R] /N (t)\ < CN- 1 ' 2 

for some C > independent of N. The asymptotical behavior of R}I N as N — > 
oo can be studied in the same way as in |24j taking into account that we have 
here slightly different mixing conditions, and so the corresponding estimates should 
be done as above via (|3.10[) - (|3.14[) . The main difference of the discrete vis-a-vis 
continuous time case is that now each G X J N (t), i = k + 1, I converges weakly as 
N — > oo to a nondegenerate Gaussian process G° (t) having the covariances 

/>min(s.£) /> 

(5.15) S(G?(t)G?(«)) = j o duj (B i (Z(«),e lj ...,e i )) 2 d/iKi).-.«i) 

which is proved combining arguments of Proposition 4.5 in [24) and of Section [4] 
above. The computation of other limiting covariances proceeds in the same way as 
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in the continuous time case. It follows that in the discrete time case the processes 
G £ converge weakly as e — > to a Gaussian process G° having the representation 

k I 
i=l i=k+l 

where each process i > k is independent of each G° with j ^ i while the 
processes G°, i < k are correlated with covariances described at the end of Section 
[4] taken with q, = i, i = 1, The argument concerning the convergence of 

processes Gf to Q° solving (|2.14l) remains the same as in the continuous time case. 

6. Some dynamical systems applications 

We start with recalling the setup from [5] and [3]. A C 2 -diffeomorphism F of 
a compact Riemannian manifold is called partially hyperbolic if there is a F- 
invariant splitting E u © E c © E s of the tangent bundle of fl with E u ^ and 
constants Ai < A 2 < A 3 < A 4 < A5 < A 6 , A2 < 1, A 5 > 1 such that ||dF(u)||/||v|| 
is between Ai and A2 on E s , between A3 and A4 on E c and between A5 and Xq on 
E u . Denote by W u the foliation tangent to E u and call S a u-set if S belongs to a 
single leaf of W u . F- invariant probability measures which are absolutely continuous 
with respect to the volume on leafs W u are called u-Gibbs measures. It is assumed 
that F has a unique u-Gibbs measure ^ SHB which is called the Sinai-Ruelle-Bowen 
(SRB) measure. 

An important role in the construction is played by Markov families which are 
collections S of u-sets which cover f2 and have certain regularity properties (see 
[8] and [9j) but we formulate here only their "Markov property" saying that for 
any S € S there are Si € S such that FS — U,Si. Now let S be a Markov 
family. Following [5] and [5] we construct on each S G S an increasing sequence 
of er-algebras T n in the following recursive way. Let T§ = {0,5*}. Suppose that 
T% is generated by {£j,n} with F n Sj. n € S. By the "Markov property" we can 
decompose F n+1 Sj^ n = L>iSji in and now let F^+i be generated by F~ n ~ 1 Sji, n . 

Next, for each X\ and xi in a u-set S put 

( v _-fi det(dF- 1 |£;")(F- 1 a : 1 ) 
P [X!,x 2 ) - JLJL det(dF- 1 \E u )(F- 1 x 2 )- 

Fix Xq £ S and let ps(x) = p(x, xq)(J s p{x, xo)dx)~ 1 . For a Markov family S and 
nonnegative constants R,a denote by Ei(S,R,a) the set of probability measures 
a defined for each continuous function g G C(f2) by 

(6.1) a(. 9 ) = / g(x)e G ^p s (x)dx 



where S € S and G is Holder continuous with the exponent a and the constant R. 
Denote also by E = E(S, R, a) the closure of the convex hull of Ei(S, R, a). The 
decay of correlations is measured in [5] and [5] via a sequence a(n) — > as n — > 00 
such that for any a € E and each Holder continuous 3 on fi, 

(6.2) \*(goF n )-p, s ™(g)\<a(n)\\g\\ 

where || • || is a Holder norm. An argument from Section 5 of compares 
the coefficient a(n) above with the more familiar rate of decay of correlations 
|M SRB (/ ' (9 o F n )) ~ M SRB (/)M SRB (.9)I and it follows from there that the latter 



18 



Yu.Kifer 



decays superpolynomially if and only if a(n) decays superpolynomially. Accord- 
ing to [7] such decay of correlations holds true for C 2 Anosov flows with jointly 
nonintegrable stable and unstable foliations and for their time-one maps. By [14] 
this remains true for an open dense set of C 2 Axion A flows as well, as for their 
time-one maps. For other partially hyperbolic dynamical systems with fast decay 
of correlations see [8], [9], [14] and references there. 

In order to estimate T] PjKtS (n) from (|2.3p we write in the same way as in Lemma 
4 from [8] that on each element S in J 7 ^ , 

(6.3) A n , s , t = E(g(f o F n+ \ f o F n+t+s )\F [t] ) = / p S {y)g s , t {F n y )dy 

Js 

where the expectation is with respect to a on S and g s .t(z) = 
g(f(F t -^z),f(F t -^ +s z)). If / and g are Holder continuous then g s t is Holder 
continuous for fixed s and t and it is uniformly in t Holder continuous when s = 0. 
Thus, by (|6.2[) we have that \A n s t — EA n a t \ decays in n with the speed of at least 
a(n) and this decay is uniform in t if s = 0. Hence, if a(n) decays superpolynomi- 
ally then dM|) holds true. This yields Theorem O for f (t) = z) = g{F t z) on a 
probability space (S, a) for a € E and an element S of a Markov family while g is 
a Holder continuous function. We observe that the measure a here plays the role 
of the probability Pr in the setup of Section [2] while /i SRB plays the role of P there. 

7. Concluding remarks: fully coupled averaging 

In the nonconventional framework as discussed in this paper even the setup of 
fully coupled averaging, i.e. when the fast motion depends on the slow one, is not 
quite clear. On the first sight we may want to deal with the equations 

(7.1) X^n + 1) = A-(n) + eB(X^(n), £(n), £(2n), S{ln)), 

£(n + l) =**.(„)(£(«)) 

in the discrete time case and 

(7.2) = e fl(A- e (i),^),e(2i),...,W) > ^=6(X e (t),C(t)) 

in the continuous time case. The problem is that £(kn) or £(fct) are not yet defined 
for k > 1 at time n or i so we cannot insert them into the first equation in (|7.1[) or 
(|7.2[) respectively, and so these equations do not define properly X s and £. 
A reasonable modification of this setup is to consider 

(7.3) X"(n + 1) = X%n) + eB{X%n), m (n), r, 2 (n), %(n)), 

»jf(n+l) = i^. (n) (» ? ?(n)) ) i = l s ...^ 
in the discrete time case and 

(7.4) = eB(X £ (t), Vl (t), V2 (t), 

fl=iTOr,f(t)),i = l,2,...,i 

in the continuous time case. We consider (|7.3p and (|7.4p as sets of £ + 1 equations 
but require that 77J (0) = 7/1(0) = • • • = f/f (0). This approach seems to be reasonable 
if we consider (17.3[) and (|7.4p as perturbations of equations with constants of motion 



(7.5) ^\n + l)=F x {^\n)) and ^-J^- = B{x,^{t)), 

at 
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i.e. when x variable remains fixed in unperturbed equations but start moving 
slowly in perturbed ones. Then rf x '{i{n + 1)) = F*(r]( x \in)) and drf x \it)/dt — 
iB(x,rt x \it)). 

As it is well known in the fully coupled setup the averaging principle not always 
holds true and when it takes place then usually only in the sense of convergence in 
average or in measure. In the nonconventional situation the problem is even more 
complicated. Consider, for instance, 

(7-6) =eB{al, v {t)^l a {t) 1 ... 1 ^ a {t)), 

where ip denotes a point on an rt-dimensional torus T™ and a denotes a constant 
n-vector (constant vector field on T"). Then p\ aip — if\ a „ — (i — l)(p. Set 
B(ip,ip) = B(a,ip,ip — ip...,i/> — (£ — l)<y3). Then the right hand side of (|7.6[) can 
be replaced by sB(a e a ip (t), <pl i0l (t), <p). If B(a) = J B(a, (pi, p)dp\dp and = 
B{a a {t)) 1 a a (0) = a then employing the technique from the proof of Theorem 
2.1 in 18 it is not difficult to see that for any compact K, 

(7.7) / sup |a„ it) — a a {st)\dadip -> as s ->• 0. 

JK0<t<T/s 
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